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Abstract 

The imprints left by quantum mechanics in classical (Hamiltonian) 
mechanics are much more numerous than is usually believed. We show 
that the Schrodinger equation for a nonrelativistic spinless particle is 
a classical equation which is equivalent to Hamilton's equations. Our 
discussion is quite general, and incorporates time-dependent systems. 
This gives us the opportunity of discussing the group of Hamiltonian 
canonical transformations which is a non-linear variant of the usual 
symplectic group. 



1 Introduction 

"Where did that [the Schrodinger equation] come from? Nowhere. 
It came out of the mind of Schrodinger, invented in his struggle 
to find an understanding of the experimental observations in the 
real world. " (Richard Feynman in ffjjj.) 

Similar statements abound in the physical literature; they are found in 
both introductory and advanced text on quantum mechanics, and we can 
read them on the web in various blogs and forums. However, they are strictly 
speaking not true; already in 1966 Nelson [23] showed that Schrodinger 's 
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equation could be derived from Newtonian mechanics; he however intro- 
duced some extra physical assumptions (stochasticity); also see the more 
recent paper by Hall and Reginatto [H] , who use the uncertainty principle. 
We also mention, because of its historical interest, Feynman's non-rigorous 
approach in [5] (see Derbes' comments [1] of Feynman's paper). 

In the present paper we will show that one can mathematically derive 
rigorously from Hamiltonian mechanics, the Schrodinger equation 

dib 

ih— = H(x, -KV x ,t)ip 

and this without any recourse to any physical 'quantum- mechanical' hy- 
potheses. In fact we will show that there is a surprising one-to-one and 
onto correspondence between Hamiltonian flows and the quantum evolution 
group, which only becomes apparent if one uses a deep property of symplec- 
tic covariance together with Stone's theorem on one-parameter groups of 
unitary operators. Schrodinger [28] was led to his equation from his knowl- 
edge of the classical Hamilton- Jacobi approach which has a close connection 
with the eikonal of classical wave theory. His original proposal was not firmly 
based on rigorous mathematics as he himself acknowledged but his intuition 
was correct. Attempts to provide a detailed relationship between classical 
and quantum mechanics has remained somewhat of a problem because we 
have been left with two widely different mathematical formulations, one in- 
volving particles evolving under dynamical laws in a phase space, the other 
involving operators and waves operating in a Hilbert space. This leaves the 
impression that there are two very different worlds, the classical and the 
quantum. However it is clear that we do not inhabit two different worlds 
and so we are left with the puzzle as how to relate the two formalisms? 

Already in studying the properties of light, we see in one phenomena both 
aspects. Geometric optics gives us light rays travelling in straight lines, while 
wave optics gives us interference and diffraction. In this phenomena we see 
the essential mathematics emerging. Ray optics emerges from symplectic 
geometry, while wave optics emerges from the geometry of the covering 
group of the symplectic group, Sp(2n,R), namely the metaplectic group, 
Mp(2n,R). Ray tracing involves symplectic flows, while the wave evolution 
is identified with the metaplectic flows. These are not separate flows but 
deeply related as the metaplectic flows can be mathematically 'lifted' from 
symplectic flows and the metaplectic flows can be projected onto symplectic 
flows. 

The fact that classical mechanics has the same mathematical structure 
as ray optics, namely the symplectic geometry, suggest that quantum me- 
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chanics in its wave description could be related to such a covering structure. 
Indeed if we restrict ourselves to quadratic Hamiltonians, we find that the 
metaplectic flow is determined exactly by Schrodinger's equation, in which 

Weyl 

the classical Hamiltonian is replaced by the Weyl Hamiltonian, H -ft H. 
Unfortunately the generalisation to all physically relevant Hamiltonians has 
always floundered on the mathematical difficulties involved in investigat- 
ing the covering properties of general group of canonical transformations 
which involve non-linear transformations. However such difficulties should 
not force us into a 'two world' situation. 

Indeed we have shown recently [UIO] that Hamiltonian mechanics, in its 
symplectic formulation, in a sense reproduces what is considered as being 
one of the hallmarks of quantum mechanics, the uncertainty principle in 
its strong form (the Schrodinger-Robertson inequalities); the argument is 
based on a deep and new property of canonical transformations, Gromov's 
[12] non-squeezing theorem, alias "the principle of the symplectic camel". 

Up to some technical assumptions on the Hamiltonian function ensuring 
us that the flows they generate exist for all times, we will in fact show that: 

There is a one-to-one and onto correspondence between Hamiltonian 
flows generated by a Hamiltonian H and strongly continuous unitary one- 
parameter groups satisfying Schrodinger's equation with Hamiltonian oper- 
ator H = H(x, —iftV x ,t) obtained from H by Weyl quantization. Equiva- 
lent^, the Hamilton equations 

x = V p H(x,p,t) , p = -V x H(x,p,t) 

are mathematically rigorously equivalent to Schrodinger's equation 

dtp 

ih— = H(x, -ihV x ,t)ip. 

This property is actually well-known (at least in mathematics 0[13]) for 
the linear flows arising from quadratic Hamiltonian functions; the new result 
is that this surprising correspondence between Hamiltonian and quantum 
flows has a quite general nature. 

The proof relies on two deep mathematical results 

• The first is that the Weyl correspondence is the only quantization 
(that is, pseudo differential calculus) which is symplectically covari- 
ant. We emphasize the word "only", because while it is reasonably 
well-known, both in mathematics and physics that Weyl operators are 
symplectically covariant, the converse, namely the only operators that 
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are symplectically covariant are Weyl operators, is largely ignored in 
the literature; this uniqueness is the key to our argument. 

• The second is Stone's theorem from functional analysis which con- 
cerns the infinitesimal generators of strongly continuous unitary one- 
parameter groups. It states that each such one-parameter group de- 
termines a self-adjoint operator. 

In Section 12.21 we will define and study the group Ham(2n, M) of all 
Hamiltonian canonical transformations following Banyaga [2] and Polterovich 
[24] ; the systematic consideration of general time-dependent Hamiltonians 
will make our task considerably easier than if we had limited ourselves to 
the case where H only depends on the phase space variables, z = (x,p). 

The hypothesis of a bounded phase space (implying a fortiori a bounded 
Universe) is not essential; it is added as an ad hoc assumption in order to 
avoid mathematical difficulties related to self-adjointness and the fact that 
Hamiltonian vector fields need not be complete. 

There are related issues that will not be addressed in this paper. For 
instance, one could envisage extending the results of this paper to the 
Schrodinger equation on Riemannian manifolds (compact or not). Since our 
main result links Hamiltonian flows with quantum flows, one might wonder 
if the control of the former allows to derive some control of the other. This 
is a question that certainly deserves to investigated, but we will leave it 
to forthcoming publication. It would be also interesting to discuss recent 
results of Schmelzer [251 EZ] about foundational issues in our context. 

CAVEAT LECTOR: We are not claiming that we are deriving quan- 
tum mechanics from classical mechanics; what we are doing is the following: 
knowing that quantum mechanics exists, we show that the mathematical for- 
mulation of quantum mechanics in its Schrodinger formulation lies within 
Hamiltonian mechanics. This does not imply that quantum mechanics -as a 
physical theory- can be reduced to classical mechanics. For as Mackey (who 
used to see quantum mechanics as a refinement of Hamiltonian mechanics) 
stresses in [20] (p. 106), quantum mechanics is not just an algorithm for at- 
taching a self-adjoint operator to every classical Hamiltonian, because such 
a program would overlook many facts: first, quantum mechanics rules out a 
large number of conceivable Hamiltonians, and secondly there are features 
of quantum mechanics (such as spin) which do not manifest themselves in 
the classical limit. However, adds Mackey, one can argue convincingly that 
classical mechanics must have independent existence since it is only what 
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quantum mechanics looks like under certain limiting conditions. Our pro- 
gram does not contradict Mackey's insight because our approach shows that 
the imprint left by quantum mechanics in classical mechanics is more than 
could be expected at first sight from the limiting conditions. 



2 The Dynamical Groups Sp(2n, R), Mp(2n, R) and 
Ham{2n,R) 

2.1 Motivation from Optics 

We recall that ray tracing in Gaussian optics uses an identical formulation 
of classical mechanics, which for the purposes of this paper can be written 
in the form 

= ft H M 

where the phase variables z = {x\ . . . x n ,pi . . . p n ) and is the flow deter- 
mined by Hamilton's equations of motion 

x = V p H(x,p) , p = -V x H(x,p). (1) 

where H(x,p) is some physically relevant Hamiltonian. 

We now ask the question, "Is it possible to lift this flow onto the covering 
group of Sp(2n,M) so as to display possible wave properties of the symplec- 
tic flow?" The answer is, "Yes, provided we restrict ourselves to quadratic 
Hamiltonians." 

To see how this works let us write the quadratic Hamiltonian in the form 
H(z, t) = \z T M(t)z = \{^j T M(t) (fj (2) 

where M (t) is a 2n x 2n symmetric matrix depending in a C°° fashion on 
the parameter t. Let us now shortly describe the metaplectic representation 
of the symplectic group Sp(2n,R). Let W be a real quadratic form of the 
type 

W(x, x ) = \Px ■ x — Lx ■ x + \Qx ■ x (3) 

with P = P T , Q = Q T , and detL ^ 0; we are writing Px ■ x for x T Px, 
etc. The function W has a very precise meaning in optics, where it is 
called the eikonal (see Chapter 1 in Guillemin and Sternberg's book [13] 
for a discussion of the relation between physical optics and the metaplectic 
group); it is also well-known in Hamiltonian mechanics, where it is called a 
generating function, or Hamilton's two-point function and is closely related 
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to action (see de Gosson 18] and the references therein) . To such a function 
W one associates a linear canonical transformation s w by the formula 

(x,p) = s w (x>,p>); (4) 

explicitly s w is then identified with the symplectic matrix 

L~ X Q L- 1 
PL- l Q-L T L- l P 



-in _ /- T r-ipl ' ( 5 ) 



One shows (de Gosson [8]) that every symplectic matrix s can be written 
(non-uniquely) as a product of two such matrices: s = s w s w hence the s w 
generate Sp(2n, R). Denoting by rrnr the choice of an argument for detL 
modulo 27r we define the Fourier quadratic transformation S w by 

S w Mx) = (-!—) ' i m J\fetL\ [ e^ w ^ x '^(x')dx' (6) 
\2mhJ J R u 

where m is the Maslov index (see e.g. de Gosson [8], Chapter 7). One 
easily checks that the operators S w are unitary isometries of L 2 (M n ) and 
that the inverse of S is the operator S w where W*(x, x') = —W(x', x) (it 
amounts replacing the triple (P, L, Q) in @ by (— Q, —L T , —P)); notice that 
the choice W(x, x') = x ■ x' leads to the usual Fourier transform up to the 
factor i~ n l 2 and that we then have s w = J (the standard symplectic ma- 
trix). The operators S w thus generate a group of unitary operators acting 
on the square-integrable functions; this group is precisely the metaplectic 
group Mp(2n,R). It is closed related to the symplectic group; one shows 
that it is a true unitary representation of the double covering Sp 2 (2n,lR) 
of Sp(2n,R), and that the mapping n Mp : Mp(2n,M) — >Sp(2n,M) defined 
by n M P(F iy ) = f w where f w is defined by @ is two-to-one. The ho- 
momorphism Sp 2 (2n,M) w Mp(2n,M) allows us to associate to the curve 
t i — > ft in Sp(2n,M) a unique curve t i — > Ft in Mp(2n,M) such that Fo is 
the identity operator. This means in terms of the projection that the re- 
lation n Mp (i ? i w/ ) = unambiguously define a one-to-one correspondence 
between continuous curves in Sp(2ra,R) passing through the identity and 
continuous curves in Mp(2n,R) passing through the identity. 

Now choose a function tpQ in some subspace of infinitely differentiable 
functions in L 2 (]R n ) and set tpt = Ft^o where (Ft) is the curve in Mp(2n,R) 
associated by the procedure outlined above to a Hamilton flow (f t ) deter- 
mined by a quadratic Hamiltonian function We have ifit £ L 2 (M. n ) for each 
t and the function ifa satisfies the Schrodinger-like equation 

l ~dt = (7) 
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where H is a partial differential operator obtained from H using the Weyl or- 
dering. That is, we perform the substitution p — > —iV x in the Hamiltonian 
([2]) after having written it in the form 

H(z, t) = -A(t)x ■ x + B(t)p ■ x + B T {t)x ■ p + -C(t)p ■ p 

where M(t) = (gT^ c(t)) with A ® = and C ® = Using 

the obvious identity BV X ■ x = x ■ BV X + i Tr B we thus obtain 

H = ^A(t)x -x-ix- B(t)V x + ho{t)V x -V x - l -TiB(t) 

from which it is easy to verify that the operator H is symmetric (in fact 
self-adjoint) for the 1? scalar product. One formally writes 

H = H(x,-ihV x ,t). (8) 

which is obtained from H by the substitution pj — > —ihd/dxj; it is of 
course formally the usual "quantization" of the Hamiltonian function H of 
standard quantum mechanics. 

It looks as if we have gained nothing by the process we have gone through, 
but this conclusion is not correct. We have shown that there is a deep 
relation between classical and quantum mechanics in this instance. It is the 
isomorphism Sp 2 (2n, R) ~ Mp(2n, M) that allows us to associate the classical 
flow with the quantum flow . In this sense we have been able to derive 
the Schrodinger equation from the classical equations of motion. 



2.2 The Group Ham(2n, M) 

The group Ham(2n, R) of Hamiltonian canonical transformations is the non- 
linear analogue of the symplectic group Sp(2n,M). What we would be look- 
ing for if we are to generalise the quadratic Hamiltonian result is a unitary 
representation of some covering structure of Ham(2n). This possibility has 
not been explored in the physical literature (to the best of the knowledge 
of the present authors). It is the purpose of this paper to show how these 
results can be extended to all physically relevant Hamiltonians. To this 
aim, the next two sections discuss the symplectic covariance of Hamiltonian 
mechanics and the corresponding symplectic covariance of Weyl quantisa- 
tion. It is through these two notions that we can show how the Schrodinger 
equation is directly obtained from classical Hamiltonian mechanics. 
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3 Symplectic covariance of Hamiltonian mechan- 
ics 



Let H now be an arbitrary Hamiltonian function, that is, a function H(x,p, t) 
which we assume to be at least once continuously differentiable in the vari- 
ables aci, ...,x n ;pi, ...,p n and t. The flow (//^) determined by these equations 
(one also says "generated by if") consists of the mappings //* : R 2n — > 
R 2n which associate to a point z = (x,p) at initial time t = the value 
zt = (xt,pt) of the solutions at time t of the differential equations 

x = V p H(x,p,t) , p= -V x H(x,p,t). (9) 

Thus, by definition, zt = f^(zo). Generically the flow is only defined in a 
neighborhood of t = 0; we will later see that the consideration of compactly 
supported Hamiltonian functions eliminates these difficulties. 

One of the problems we face is that for time dependent Hamiltonians, 

H(t) 

the corresponding flows f t are such that 

f H(t) f H(t') , f H(t+t') 
h h T Jt+t' 

and therefore does no have the required group property. In subsection 2.3 
we will show how a group Ham(2n, R) can be constructed. In the mean time 
we note that when H does not dependent on time: dH/dt = 0, the system 
of differential equations ([9]) is autonomous and the flow (/ t ) satisfies the 
one-parameter group property //* / t , = fE t i whenever the mappings f^ 1 , 
f#, and f^_ t , exist. 

A fundamental property of Hamiltonian flows is that each ft is a canon- 
ical transformation (or symplectomorphism) . This means that for every 
z = (x,p) the Jacobian matrix 

is symplectic, i.e. Df^{z) G Sp(2n,R). (In some texts the term "canonical 
transformation" refers more generally to mappings that preserve the form 
of Hamilton's equations; the definition we use here is thus more restrictive). 
The simplest proof of this property consists in showing that the matrix 
St = Df^(z) satisfies the first order matrix equation 

^-S t = JH"(zt,t)S t (10) 
at 



8 



where H" = D 2 H is the Hessian matrix of the Hamiltonian, i.e. the matrix 
of its second derivatives in the variables x and p (see [8], §2.3.2) for the 
derivation of the equation (jlOp . which is sometimes called the "variational 
equation" in the literature). To show that St is symplectic, we first set 
At = (St) T JS t . It follows from this equation that 

dt dt v 1 dt 

= {St) T H"{z t )St - (St) T H"{z t )S t 
= 

hence At is constant. Thus At = Aq = (Sq) t JSo = J which shows that 
St € Sp(2n,R). 

Let now s be an arbitrary symplectic matrix; by the chain rule, the Jaco- 
bian matrix of sf^ s~ l is sS^s -1 hence it is also a canonical transformation; 
it is in fact the flow of a certain Hamiltonian. Let us now construct this 
Hamiltonian: 

Proposition 1 The family of canonical transformations sfp s -1 is the Hamil- 
tonian flow determined by the function K(z,t) = H(s~ 1 z,t). That is, 

8f*8- X = f* ,K(Z)=H(S-'Z). (11) 

Proof. Writing Hamilton equations for K as z = JV z [H(s~ 1 z, t)j we get, 
using by the chain rule 

V z [H(s- 1 z)] = (s T )- 1 (V z H)(s- 1 z) 

hence, since sJs T = J because s is symplectic, 

i = J(s T )- 1 {V z H)( S - 1 z) = aJ(V*ff)(s -1 z)]. 

Thus s~ 1 z(t) is the solution of Hamilton's equations for H with initial datum 
s~ l z(0); it follows that //^(s -1 ^) = s~ l ft < (z) which is equivalent to formula 

(dm. ■ 

We will call the property just proved the symplectic covariance property 
of Hamiltonian mechanics. It will be generalized in the next subsection to 
the nonlinear case, where we study products of Hamiltonian flows. Formula 
(jlip remains true if we replace s by an arbitrary symplectic transformation 
/. The result is actually well-known in standard Hamiltonian mechanics in 
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the following form: set (x',p f ) = f(x,p) and K = H o /; if / is a canonical 
transformation then we have the equivalence 

x = V p iK(x ,p' \t) and p = —V x 'K(x',p r ,t) 

«=* (12) 
x = V p H(x,p,t) and p = —V x H(x,p,t). 

(see de Gosson [8], §2.3.2, for a proof). Let Xh the Hamilton vector field of 
H; by definition 

X H (z,t) = (V p H(z,t),-V x H(z,t)). 

Hamilton's equations are thus equivalent to z = Xjj(z, t). Note that Xjj(z, t) 
is not strictly speaking a vector field on M. 2n because of the dependence in 
t. In terms of Xh we can restate the theorem above as the transformation 
law 

X Hof (z) = [Df(z)r 1 (X H of)(z) (13) 
for Hamilton vector fields. 

3.1 Operations on Hamiltonian Flows 

We assume as before that all Hamiltonians have flows defined for all values 
of time t. 

Proposition 2 Let (ff 1 ) and be Hamiltonian flows. Then: 

ft H ft K = ft H * K with H#K(z,t) = H(z,t)+K({f t H )- 1 (z),t). (14) 
(ft H r l = ft K ™th K(z,t) = -H(f t H (z),t). (15) 

Proof. Let us first prove (|14|). Using successively the product and chain 
rules, we have 

J t (ft H ft K ) = (j t ft H ) ft K + ( D ft H )ft K J t ft K 

= X H (f t H f t K ) + (Df t H )f t K o X K (f t K ) 
and it thus suffices to show that 

(Df{ I )ffcX K (fF) = X Ko(ffyl (f t K ). (16) 

Writing 

(Df t H )f t K oX K (f t K ) = (Dff)((f t H )- l f t H f t K )oX K ((f t H )^f t H f t K ) 
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the equality f)16|) follows from the transformation formula f|13|) for Hamilton 
vector fields. Formula (|15p is now an easy consequence of ()14|) . noting that 
if K is given by the second formula (|15|) then (ft 1 ft^) is the flow determined 
by the Hamiltonian 

#CM) = H(z,t)+K((f t H )- l (z),t) = 0; 

ft ft i s thus the identity, so that (ft 1 )' 1 = as claimed. ■ 

The formulae above show why we cannot avoid considering time-dependent 
Hamiltonians: even if H and K do not depend explicitly on time, the prod- 
uct and inverse of their flows is generated by time-dependent Hamiltonians! 
This remark will be important below when we define the group of Hamilto- 
nian canonical transformations. 

3.2 The group of Hamiltonian canonical transformations 

From now on we assume that the solutions for the Hamilton equations with 
arbitrary initial data at time t = are uniquely determined by these data 
and exist for all times. This may seem to be a strong technical restriction, 
but it can actually easily be implemented (see e.g. Polterovich [24] ) . If 
H is such that is not defined for all values of t, we just replace H by 
the function HQ where O = Q(x,p) is a compactly supported infinitely 
differentiable function equal to one on some arbitrarily chosen subset Q of 
phase space. 

Now proposition [2] allows us to define the group Ham(2n,IR) of Hamil- 
tonian canonical transformations. To do this, let us introduce the following 
terminology. We will say that a canonical transformation / of HL 2n is Hamil- 
tonian if it is the time-one flow of some Hamiltonian function, i.e. if there 
exists a (time-dependent) Hamiltonian H such that we have / = fj^i- Note 
the set of all Hamiltonian canonical transformation of M. 2n is denoted by 
Ham(2n,M). The choice t = 1 is arbitrary, and can be replaced by any 
other time to, as a consequence of the following simple observation: de- 
fine the Hamiltonian function K(z,t) = toH(z,tot); the flow it determines 
is given by ffj to = ft ■> hence f^ = fi is also a Hamiltonian canonical 
transformation. 

It follows from Proposition [2] that Ham(2n, R) is indeed a group for the 
product operation: first, the product is clearly associative (because compo- 
sition of mappings is). Assume that H is a constant; then / t is the identity 
/ on M. 2n and hence / is in Ham(2n,R). Suppose now / and g are two 
Hamiltonian canonical transformations: / = f£L 1 and g = f^-i- In view of 
formula (I14D we have fg = fS[ hence fg is also a Hamiltonian canonical 
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transformation. That the inverse / 1 also is in Ham(2n,R) follows similarly 
from formula (|15p . 

3.3 The key role of Banyaga's Theorem. 

The group Ham(2ra,R) contains the symplectic group Sp(2ra,R) as a sub- 
group. Equivalently and more important for us here: every symplectic ma- 
trix is the time-one flow of some Hamiltonian function. This is obvious if 
s is in the range of the exponential exp : sp(2n,R) — > Sp(2n, R), writing 
s = exp(X): the formula St = exp(tX) then defines a one-parameter group 
of symplectic matrices, which is the propagator for the Hamilton equations 
for H(z) = —^JXz ■ z. The general case easily follows by induction since 
every symplectic matrix can be written as a product exp(-Xi) • • ■ exp(Xk) 
where Xi, ...,Xk are in the Lie algebra sp(2n,R). There is actually a more 
constructive way to prove this result as a particular case of the following 
far-reaching property of Ham(2n,R), proven in full generality by Banyaga 
[2] in the context of symplectic manifolds: 

Theorem 3 (Banyaga) Let (ft) be a family of Hamiltonian canonical trans- 
formations depending smoothly on the parameter t, and such that fo = I (the 
identity). Then ft = ft 1 for some Hamiltonian function H(z, t) given by the 
formula 

H (z, t) = H(z, 0) - / cr(X(uz,t),z)du (17) 
J o 

where X = (A ft) and a is the symplectic form a(z,z') = (z') T Jz = 
Jz ■ z' on the phase space R 2n . 

Let us sketch the proof (for a complete proof see Polterovich [23] or 
Banyaga's original paper [2]). One begins by noting that if Xh is a Hamil- 
tonian vector field, one can reconstruct H by the following method: first 
write 

f 1 d 

H(z,t) = H(z,0) +/ —X H (uz,t)du 
Jo du 

= H(z,0)+ [V z H(uz,t) ■ z]du 
Jo 

(the second equality in view of the chain rule). Next observe that since 
V z H(uz,t) = — J 2 V z H(uz,t) = —JXH(uz,t), we have 

H(z, t) = H(z, 0) - [ a(X H (uz,t),z)du 
Jo 
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where a is the standard symplectic form. One then proves the proposition 
by setting X = [4tft) ft' 1 and showing that the flow determined by the 
Hamiltonian functions is precisely (ft)- 

This result is remarkable, and even in some sense surprising, because it 
shows that the datum of a family of time-one Hamiltonian canonical trans- 
formations, each coming from a different Hamiltonian function, is itself the 
flow of some Hamiltonian. 

This has several non-trivial consequences. First, it shows that the group 
Ham(2n, M.) is arcwise connected; secondly it immediately implies that 
Sp(2n,M) C Ham(2n,IR): let s € Sp(2n,R) and choose a path (s t ) joining 
the identity to s in Sp(2n, R). Then X = (■jgSt) s^ 1 belongs to the symplec- 
tic Lie algebra sp(2n,R); in particular it is a matrix so that formula (JT7J) 
defines a quadratic Hamiltonian whose associated flow is precisely (st)- 

3.4 Reduction to the time-independent case: the extended 
phase space 

It turns out that the study of time-dependent Hamiltonian function can 
always be reduced to that of Hamiltonians which do not explicitly contain 
time. As we will see, the price we have to pay for this is that we have to 
work in a phase space with dimension n + 2 instead of n. Thus, in a sense, 
there are no time-dependent Hamiltonians! 

The trick is the following, which seems to go back to |33|. [T6] (see [32] 
for an analysis in depth). We define a new Hamiltonian function H by the 
formula 

H(x,p, t, E) = H(x,p, t) — E (18) 

where E is a new variable, viewed as conjugate to the time t (the latter now 
has the status of a "position variable" ; we could as well write the definition 
of H in the form 

H(x,p,x n+1 ,p n+1 ) = H(x,p,x n+ i) -Pn+i 

but we will however stick to the notation (|18p if only for economy. The 
function H is defined on the extended phase space M 2n+2 = M 2 ,^, x lg x R ( 
and the associated Hamilton equations are, expressed in terms of the original 
Hamiltonian H: 

-V X H 

1 



dE dH 



df 



dt 



dp 
dP 

dt_ 
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where the parameter t' plays the role of a new "time" ; since H does not con- 
tain explicitly that parameter it is indeed a "time-independent" Hamiltonian 
on extended phase space. Notice that in view of the fourth equation above 
we may choose t' = t so that the two first equations are just the Hamiltonian 
equations for H; as a bonus the third equation is just the familiar law for 
the variation of energy of a time-dependent Hamiltonian system: 

dE d . . d . . 
— = -H(x,p,t) = -H(x,p,t) 

(the second equality because of the chain rule and using the fact that x and 
p satisfy Hamilton's equations). 

We can now define the "extended Hamiltonian flow" of H by the 

formula //* = f^ 1 . Notice that since (ft 1 ) is the flow determined by a time- 
independent Hamiltonian (ft 1 ), it enjoys the one-parameter group property 
ft ft/ = ft+t' an< ^ fo 1 = I (t ne identity operator on the extended phase 
space R 2n+2 ). 

Denote now by (f^ t <) the two-parameter family of canonical transforma- 
tions of R 2n defined as follows: for fixed t' the function z = fj* t ,(z') is the 
solution of Hamilton's equations for H taking the value z' at time t' . Clearly 
ft H t is the identity operator on E 2n and f t H t ,f t ¥ t „ = f t H t „, (f^)' 1 = f"r The 
two-parameter family (f^ t >) ls sometimes called the "time-dependent flow"; 
it is related to the extended flow defined above by the simple (and useful) 
formula 

f t H (z',t',E') = (f t H t ,(z'),t + t',E t , t ,) (19) 

with 

Et,t' = E' + H(ft H t,(z'),t)-H(z',t>). (20) 

We will see below that the procedure above extends, suitably modified, to 
the case of the Schrodinger equation. 

4 The Weyl Formalism and Schrodinger 's Equa- 
tion 

In classical physics, observables are taken to be functions on phase space, 
limited by whatever conditions of regularity that are required by the physics 
of the problem. A class of observables having been chosen, a state is a 
continuous functional on this class (the continuity being imposed for some 
specified topology). We will cast a wide net and accept as observable not 
only functions, but any tempered distribution on the configuration space 
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R d ; the space of such distributions is traditionally denoted by <S'(R n ). It is 
the dual of the Schwartz space <S(R n ) of all complex functions defined on 
R n which decrease, together with their derivatives, faster than the inverse 
of any polynomial in the variables x\, ...,x n . 

An observable will usually be denoted by a small Latin letter, such that 
a, 6,...; tradition provides us with the exception confirming the rule: when 
an observable is viewed as a Hamiltonian function it will always be denoted 
by H, K, etc. 

4.1 Pseudodifferential operators 

We begin by briefly discussing some well-known aspects from the theory 
of pseudodifferential operators (see for instance Shubin [29]). Let A be a 
linear operator 5(lR n ) — > S'(R n ). If we assume that A is continuous from 
S(R n ) to <S'(IR Tl ) (when <S'(M n ) is equipped with the weak-* topology, that 
is the topology of pointwise convergence) then a deep result from functional 
analysis, Schwartz's kernel theorem, tells us that there exists a distribution 
K A in S'(R n x R n ) such that for all ip,<f> in S(R n ), we have (Ai/>,4>) = 
((K a ,i/j <8> </>*)) where (•,•) and ((•,•)) are the distributional brackets for 
S'{R n ) and S'(R n x R n ), respectively (see Grochenig [II], §14.3, for an 
elegant proof of the kernel theorem in the broader context of Feichtinger's 
modulation spaces). We will informally write 

Ail){x)= [ K A (x,y)i>(y)dy (21) 

where the integral should be interpreted in the distributional sense. 

Let now r be an arbitrary fixed real number and define a distribution 
a T (x,p) by the formula 

a T (x,p)= [ e--n py K A {x + Ty,x-{l-T)y)dy- (22) 

where we have introduced a scaling factor which for convenience, we have 
called h. This ensures that the exponential is dimensionless. The magnitude 
of this factor is left open in this paper since we are concentrating only on 
mathematical relationships. 

Using the Fourier inversion formula, the kernel is then expressed in terms 
of a by 

K A (x,y) = {^ E ) n f e^-^a T ((l-T)x + ry,p)dp. (23) 
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It follows that formula (|2ip can be rewritten in pseudodifferential form as 
M*) = {ikY 1 1 e^^VCCl - t)x + ry,p)^{y)dydp (24) 

Thus we have shown how to associate an operator A with an observable a T . 

Now we can use formula (|24p to associate to given observable a with an 
operator A T for every value of the constant r: by definition 

A M*) = {m)" / / eW*-w)a((l - t)x + ry,p)^(y)dydp (25) 

the "double integral" being again interpreted in some reasonable sense (for 
instance as a repeated distributional bracket; it is actually no restriction in 
our discussion to assume that the observable is in 5(R 2n ) in which case the 
right hand side of (|25j) strictly makes sense since the integral is absolutely 
convergent). 

Let us dignify the discussion above in compact form as a Theorem: 

Theorem 4 Every continuous linear operator A : <S(R n ) — > S'(M n ) can be 
written in pseudodifferential form (24\ ), and this for every real value of the 
parameter r. 

The (admittedly somewhat technical) discussion above shows that for 
each value of r we have a bijective (= one-to-one and onto) correspondence 
a < — > A T between observables and continuous operators <S(IR n ) — > 5'(K n ) 
given by (|25|) . We emphasize again that each choice of r is a priori equally 
good; for instance if we take r = then we get 

A M%) = {^rf [ [ eiP< x -^a(x,p)ip(y)dydp 

J JR n xM. n 

= / efr*a{x,p)F1>{p)dy 

[Ftp the /i-Fourier transform) which is the standard pseudodifferential rep- 
resentation widely used in partial differential equations. There is however 
a privileged choice in quantum mechanics: it consists in taking t = \- The 
corresponding operator -Ax/2 wm De denoted A and called the Weyl operator 
corresponding to the observable (or symbol) a. Explicitly 

&I>1?) = {^nT [ [ ete<*-ria(±(x + y),p)^{y)dydp 
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and a few elementary manipulations show that this equivalent to the follow- 
ing expression 

M*) = i^nT I a a {z')T(z')^{x)dz' 
which is more familiar to physicist; here 




(the symplectic Fourier transform of a) and T(z') is the Heisenberg-Weyl 
operator: 

f{z')^{x) = e^ p '- x -y- x 'H{x - x'); 

see Littlejohn [17J for an analysis of these operators of which we have given 
a detailed study in [S] . 

The Weyl correspondence associates in a unique way to every observable 
a 6 <S'(IR n ) a continuous linear operator A whose domain always contains 
<S(R n ) and whose range is contained in S'(M. n ). This correspondence actu- 
ally is not only one-to-one, but also onto: every continuous linear operator 
A : S(R n ) — > S'(M. n ) is associated to an observable a via the Weyl corre- 
spondence. We will symbolically write the Weyl correspondence between an 
observable a and the corresponding operator A is a A or A a. (In 
the mathematical literature a is called the Weyl symbol of A ) . 

The Weyl correspondence has the following very agreeable feature, which 
distinguishes it from other r-correspondences, and which already motivated 
Weyl's interest: the transition to the adjoint operator is particularly simple. 

Weyl ^ _ Weyl ^ 

If a -ff A then a ft A* . In particular, if a is a real observable then A 
is formally self-adjoint. Thus the Weyl correspondence associates to real 
observables formally self-adjoint operators. This is one of the reasons why 
the "Weyl ordering" was adopted at an early stage (but more or less explic- 
itly) in quantum mechanics. The second reason is that Weyl operators are 
characterized by a symplectic covariance property, an important character- 
isation for the discussions in this paper. We will now discuss the details of 
this symplectic covariance. 

4.2 Symplectic covariance of Weyl calculus 

The fundamental result, which is at the very heart of our derivation of 
Schrodinger's equation in next subsection is the following: 
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Theorem 5 Let £(5(IR n ), 5'(]R ra )) be the space of all continuous linear map- 
pings S{R n ) — ► S'{R n ). Let 

M : S'(R 2n ) — > £{S(R n ),S'(R n )) 

be a continuous mapping associating to each observable a a continuous op- 
erator A4(a). If M. has the two following properties: 

(i) If a = a{x) then M(a) is the operator of multiplication by a(x); 

(ii) For every s in Sp(2n,M) we have M(ao s^ 1 ) = SA4(a)S^ 1 where S 
in Mp(2n,M) is such that tt Mp (S) = s then A4 is the Weyl correspondence, 

,1 i ■ Weyl . . , 

that is a i — > A4{a). 

Note that if a O A then we have a o s~ < — > SAS~ . this is the well- 
known property of symplectic covariance of Weyl calculus; we have given 
a detailed proof of this property in jSj, §7.1.3. There is also a proof in 
Littlejohn p7] . 

The real impact of the theorem above is that it says that, conversely, 
among all possible pseudodifferential correspondences a < — > A T between 
observables and continuous operators (I25D . the Weyl correspondence is the 
only one enjoying the property of symplectic covariance. The proof of this 
result is alluded to in Stein's book [30] (pp. 578-579) and proven in detail 
in the last Chapter of Wong's book [33]. Wong's proof actually relies on the 
following additional assumption (which is also found in Stein's statement): 
the mapping M. should associate to every essentially bounded observable a 
which depends only on x, the operation of multiplication by a: if a = a(x) 
then Q(a)4> = arp. However this property is automatically satisfied by the 
operators A T : we have 

A M%) = (^nT 1 1 er^-y) a ((l - t)x + Ty)tfj(y)dydp 



\2irh) 



a((l - t)x + ry)i/j(y)dy 



5{x - y)a{{\ - t)x + Ty)^j(y)dy 
a(x)ijj(x). 
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5 Derivation of Schrodinger 's equation 



5.1 Statement of the main result 

Now we come to the main result in the paper, namely, to show that the 
Schrodinger equation can be derived from Hamilton's equations of motion 
by lifting the flows of Ham(2n,R) onto a unitary representation of a 
covering structure. 

In order to motivate our approach let us now briefly return to the prop- 
erty of the metaplectic representation of Sp(2n, R) which shows that to every 
family (st) of symplectic matrices depending smoothly on t and such that 
so = I, we can associate, in a unique way, a family (St) of unitary operators 
on L 2 (R n ) belonging to the metaplectic group Mp(2n,R) such that So = 1 
and S t S t > = S t+t > . 

We have seen from Banyaga's theorem, that for every flow (st) there 
exists some Hamiltonian H, so that we can write (st) = {ft 1 )- Now if we 

use the correspondence H H we have (St) = (F^ 1 ) where 

S t = e- lSt ' n 

which is the solution of the Schrodinger equation 

ih—St = HSt 
dt 

Here we have again introduced a scaling parameter having the dimensions of 
action which, for convenience, we have denoted by h. Note that generically 
H is not a bounded operator on L 2 (R n ) so that the exponential has to be 
defined using some functional calculus (see Reed and Simon [25] §VIII.3 for 
a discussion of these technicalities). 

We now ask whether this property has an analogue for paths in the group 
Ham(2n,R) of Hamiltonian canonical transformations. That is can we find 
a (F^ 1 ) corresponding to (ft 1 ) for every physically relevant Hamiltonian. To 
sharpen up the discussion, let us introduce the following notation: 

• Denote by "PHam(2n,R) the set of all one-parameter families (ft) in 
Ham(2n,R) depending smoothly on t and passing through the iden- 
tity at time t = 0. As we have remarked above, such a family of 
canonical transformations is always the flow (ft) of some (usually 
time-dependent) Hamiltonian H (Banyaga's theorem). 

• Denote by VU (I? (R n )) the set of all strongly continuous one-parameter 
families (Ft) of unitary operators on L 2 (R n ) depending smoothly on 
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t and such that Fq is the identity operator, and having the following 
property: the domain of the infinitesimal generator H of (Ft) contains 
the Schwartz space 5(M n ). 

Recall that strong continuity for a one-parameter group (Ft) means that 
we have 

lim F t iP = F t ^ (26) 
t yto 

and the infinitesimal generator of (Ft) is the the operator defined by 



a = in -F t ^ 

at 



= ih lim FAt f ~ t (27) 
t=Q At->0 At 



for every vp £ L 2 (M n ) and every real number to] formally Ft = e ihA / t . 
Formally we have Stone's theorem |31j : 

Theorem 6 (Stone) For every strongly continuous one-parameter group 
(Ft) of unitary operators on a Hilbert space H there exists a self- adjoint 
operator A on L 2 (IR n ) such that Ft = ^ tA l h - in particular A is closed and 
densely defined in H. Conversely, if A is a self-adjoint operator on rl then 
there exists a unique one-parameter unitary group (Ft) whose infinitesimal 
generator is A, that is Ft = e ltA / h . 

For self-contained proofs we refer Reed and Simon [25], §VIII.4, or to 
Abraham et al. [1], Supplement 7.4B, pp. 529-535. 

We will now use Stone's theorem to prove the hard part of our main 
result: 

Theorem 7 There exists a bijective correspondence 

C : VU(L 2 (R n )) HPHam(2n,I) 

whose restriction to families (st) of symplectic matrices reduces to the meta- 
plectic representation, and which has the following symplectic covariance 
property: for every (ft) in 7 3 Ham(2n,M) and for every s in Sp(2n,M) we 
have 

C^fts- 1 ) = (SFtS- 1 ) (28) 

where S is any of the two operators in Mp(2n, R) such that tt Mp (S) = s. 
This correspondence C is bijective and we have 

ihjFt = HF t . (29) 

^ Weyl 

where H < — > H , the Hamiltonian function H being determined by (ft)- 
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It is perhaps worth observing that it is always preferable to take the fam- 
ily (Ft) as the fundamental object, rather than H (and hence Schrodinger's 
equation). This remark has already been made by Weyl who noticed that 
(Ft) is everywhere defined and consists of bounded operators, while H 
is generically unbounded and only densely defined (see the discussion in 
Mackey [20] for a discussion of related questions). 

5.2 Proof: the time-independent case 

Let us begin with the case where (ft) is the Hamiltonian flow determined 
by a time-independent Hamiltonian function H = H(x,p), in which case 
(ft) = (ft 1 ) i s a one-parameter group, that is fP ffi = fR. t '- We thus want 
to associate to (ft 1 ) a strongly continuous one-parameter group (F t ) = (F t H ) 
of unitary operators on L 2 (R n ) satisfying some additional conditions. We 
proceed as follows: let H be the operator associated to H by the Weyl 

correspondence: .ff H and define C(ff ) = (F t ) by F t = e~ itS / h . The 
Weyl operator H is self-adjoint and its domain obviously contains tS(M n ). 
Let us show that the covariance property (|28p holds. We have 

C(sft H s- 1 )=C(f t Hos - 1 ) 
in view of formula ([lip in Proposition [H that is, by definition of C, 

C(sf t H s- 1 ) = (g-iW^A). 

In view of the symplectic covariance property a o s < — > SAS~ of Weyl 
operators we have H o s _1 = SHS" 1 , and hence 

C{8f?8- X ) = (e-* 8 ™- 1 /*) = (Se- itS / h S- v ) 

which is property (|28j) . 

Let conversely (F t ) be in VU(L 2 (R n )); we must show that we can find a 
unique (ft) in "PHam(2n,lR) such that C(ft) = (Ft)- By Stone's theorem and 
our definition of T'U(L 2 (W 1 )) there exists a unique self-adjoint operator A, 
densely defined, and whose domain contains 5(M n ). Thus A is continuous on 
<S(M n ) and in view of TheoremU]for each value of the parameter r there exists 

an observable a such that A < — > a T . Choose r = 4; then A = H < — > H for 
some function H = H(x,p) and we have C(f^) = (Ft). 

There remains to show that the correspondence C restricts to the meta- 
plectic representation for semigroups (ft) = (st) in Sp(2n,M); but this is 
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clear since (st) is generated, as a flow, by a quadratic Hamiltonian, and that 
the unitary one-parameter group of operators determined by such a function 
precisely consists of metaplectic operators. 



5.3 Proof: the general case 

We now no longer assume that (ft) and (Ft) are one-parameter groups. 
Recall that we have reduced the study of a time-dependent Hamiltonian 
H = H(z, t) to by introducing 

H(x,p,t,E) = H(x,p,t) - E (30) 

which is a time-independent Hamiltonian on IR 2n+2 = IR 2 ^, x x where 

E is viewed as conjug ate variable to t. The flow (f t H ) = (f t H ) on R 2n+2 
generated by H is related to the time-dependent flow (//£/) by the formula 

f t H (z',t',E') = (f t H t ,(z'),t + t',E t , t ,) (31) 

where Ety — E' is the variation of the energy in the time interval [t', t]. The 
advantage of this reformulation of the dynamics associated with H is that 
( ft*) is a one-parameter group of canonical transformations of M 2ri+2 . In the 
operator case we can proceed in a quite similar way, noting that the Weyl 
operator associated with H is given by 

~ d 
H = H(x,-ihV x ,t) -ih—. 

Of course H is self-adjoint if and only if H is, which is the case since H is 
real. We will need the following elementary fact, which is a variant of the 
method of separation of variables: 

Lemma 8 Let E be an arbitrary real number. The function 

V(x,t;t') = ^(x,t)ei E(t - tr) (32) 
is a solution of the extended Schrodinger equation 

ih— = (33) 
if and only if ip = ip(x,t) is a solution of the usual Schrodinger equation 

ih^ = Hi,. (34) 
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Proof. We first note the obvious identity 




(35) 



Writing for short H(t) = H(x, —iKV x ,t) we have, after a few calculations 



hence (j33j) is equivalent to (|M|) in view of (|35|) . ■ 

This result shows the following: choose an initial function ipo = ipo(x) 
at time t = and solve the usual Schrodinger equation (I34p . which yields 
the solution ip = ip(x, t). Then ^ = ^(x, t; t') defined by ([32]) is the solution 
of the extended Schrodinger equation (j33j) with initial datum ^(x,t;t) = 
i{)(x,t) at time t' = t. In terms of flows we can rewrite this as 



6 Conclusion 

In this paper we have shown how one can mathematically derive rigor- 
ously the Schrodinger equation from Hamiltonian mechanics. In that proof 
Banyaga's theorem [2] was seen to play a key role, when used in conjunction 
with the Weyl formalism. In using the latter, it was necessary to introduce 
a scaling factor which we arbitrarily chose to be h even though we made 
no appeal to any quantum process. In this way we have shown that the 
mathematical formalism of the theory of Schrodinger's equation is already 
present in classical mechanics, and is in fact a reformulation of Hamiltonian 
dynamics in terms of operators. So where does quantum physics enter the 
scene? The most obvious question is, of course, how do we give a physical 
meaning to the constant h. That is: Why do we need Planck's constant? 
Let us explore a few possibilities; the list is certainly not exhaustive, and the 
choice has been done in accordance with the present authors' tastes (and 
prejudices!). 

Spectral properties of operators In a sense the most obvious and 
naive way to give a physical motivation for the need of Planck's constant 
is, no doubt, empirical. We have seen that there is a one-to-one corre- 
spondence between Hamiltonian flows and unitary evolution operators solu- 
tions of Schrodinger equations. Now, the spectral properties of the involved 




(36) 
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Hamiltonian operators are well-understood; a basic postulate of traditional 
quantum mechanics is that the eigenvalues of an operator are the values 
that the corresponding observable can take. Using this postulate one can 
thus put in the right value of the constant H by hand. This is an empirical 
motivation, but it seems hard to refute because we know that it works! It 
is irrefutable from an epistemological point of view: putting in h that way 
we are sure to get the right physical answers! 

The Narcowich Wigner spectrum of a mixed state Consider a den- 
sity matrix p: by definition it is a self adjoint non-negative trace class opera- 
tor with trace unity: p = p*,p>0,Trp = l. As is well-known from operator 
theory it is the positivity property p > which is generally the most difficult 
to check; one explicit tool is provided by the Kastler-Loupias-Miracle Sole 
(KLM) conditions [151 OS H], but these imply the simultaneous verifica- 
tion of infinitely many conditions (mathematically the KLM conditions are 
a symplectic variant of Bochner's positivity criteria for the Fourier trans- 
form of a measure). Narcowich |21j and Narcowich and O'Connell [22J have 
noticed the following property of mixed quantum states: the positivity prop- 
erty depends in a crucial way on the numerical value of h. That is, if we 
change h some classical states become "quantum" (that is, representable by 
a density matrix), and some quantum states loose this property (their den- 
sity matrix is no longer non-negative). The set of all values of h for which p 
remains positive is then called the Narcowich- Wigner spectrum of p. 

Information theory Perhaps, after all, the answer ultimately lies in in- 
formation theory. It might very well be that the discrete nature of informa- 
tion is the key for the passage from classical mechanics to quantum theory. 
Paraphrasing Anton Zeilinger |35j : 

In conclusion it may very well be said that information is the 
irreducible kernel from which everything else flows. The question 
why Nature appears quantized is simply a consequence of the fact 
that information itself is quantized. It might even be fair to ob- 
serve that the concept that information is fundamental is very 
old knowledge of humanity, witness for example the beginning of 
gospel according to John: "In the beginning was the Word and 
the Word was with God, and the Word was God". 
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